Classification and Identification of Telugu Handwritten Characters Extracted from Palm Leaves Using Decision Tree Approach
نویسندگان
چکیده
Research in character recognition is very popular for various application potentials in banks, post offices, defense organizations, reading aid for the blind, library automation, language processing and multi-media design. Even though Epigraphical work dealing with stone inscriptions have been analyzed, these have been done largely manually and also on 2D traces. A large collection of these are available in the classical Indian languages like Sanskrit, Tamil, Pali etc as well as in more modern languages like Telugu. These characters on the palm leaf have the additional properties like depth, an added feature which can be gainfully exploited in character recognition. In this paper, we explore how these 3D features can be extracted and how they can be used in the recognition and classification process. This paper describes a system to identify and classify Telugu (a south Indian language) characters extracted from the palm leaves, using Decision Tree approach. The decision tree is developed using SEE5 algorithm, which is an improvement from the predecessor ID3 and C4.5 algorithm. The identification accuracy obtained is 93.10% using this method.
منابع مشابه
Analysis of Telugu Palm Leaf Characters Using Multi- Level Recognition Approach
Palm leaf character recognition is an area which is at the nascent stage of research. Although character recognition is a well-known application of pattern recognition, lot of work is still to be exploited in handwritten character recognition. The recognition accuracy as per the literature survey for handwritten English characters is very low and for Indian languages it is just started. Researc...
متن کاملMulti-stage Strategy to Classify Handwritten Characters of Telugu
The aim of this work is to recognize handwritten characters of Indian language, Telugu. Single stage of classifying similar Telugu characters leads to low recognition rate. However similar characters of Telugu (Indian language) are recognized in two stages in the current work. Various preprocessing steps are carried out first to extract characters from the handwritten documents. The preprocesse...
متن کاملFPGA based Histogram Equalization Technique to Recognize Characters in Handwritten Scriptures of Palm Leaves
BSTRACT Multi-core processors play a vital role in the application such as image processing. In this manuscript the focus is made on such types of imaging applications where in which, the image processing is done on handwritten scriptures of palm leaves to study its mechanics of characters inscribed. Here in this paper the approach adopted is the digital image reconstruction with acquiring of t...
متن کاملArabic Text Recognition
The issue of handwritten character recognition is still a big challenge to the scientific community. Several approaches to address this challenge have been attempted in the last years, mostly focusing on the English pre-printed or handwritten characters space. Thus, the need to attempt a research related to Arabic handwritten text recognition. Algorithms based on neural networks have proved to ...
متن کاملOn-line Recognition of Arabic Handwritten Characters
In this study, a new approach for the recognition of isolated handwritten Arabic characters is presented. The proposed method places a 5x5 grid on the character to extract the features needed for the recognition step. These features are calculated based on grid calculations. Then these features are feed to the decision tree to classify the character into one of the 28 classes. The classificatio...
متن کامل